Ultrafast shape recognition for similarity search in molecular databases
نویسندگان
چکیده
Molecular databases are routinely screened for compounds that most closely resemble a molecule of known biological activity to provide novel drug leads. It is widely believed that three-dimensional molecular shape is the most discriminating pattern for biological activity as it is directly related to the steep repulsive part of the interaction potential between the drug-like molecule and its macromolecular target. However, efficient comparison of molecular shape is currently a challenge. Here, we show that a new approach based on moments of distance distributions is able to recognize molecular shape at least three orders of magnitude faster than current methodologies. Such an ultrafast method permits the identification of similarly shaped compounds within the largest molecular databases. In addition, the problematic requirement of aligning molecules for comparison is circumvented, as the proposed distributions are independent of molecular orientation. Our methodology could be also adapted to tackle similar hard problems in other fields, such as designing content-based Internet search engines for three-dimensional geometrical objects or performing fast similarity comparisons between proteins. From a broader perspective, we anticipate that ultrafast pattern recognition will soon become not only useful, but also essential to address the data explosion currently experienced in most scientific disciplines.
منابع مشابه
Ultrafast shape recognition to search compound databases for similar molecular shapes
Finding a set of molecules, which closely resemble a given lead molecule, from a database containing potentially billions of chemical structures is an important but daunting problem. Similar molecular shapes are particularly important, given that in biology small organic molecules frequently act by binding into a defined and complex site on a macromolecule. Here, we present a new method for mol...
متن کامل3D Face Recognition using Patch Geodesic Derivative Pattern
In this paper, a novel Patch Geodesic Derivative Pattern (PGDP) describing the texture map of a face through its shape data is proposed. Geodesic adjusted textures are encoded into derivative patterns for similarity measurement between two 3D images with different pose and expression variations. An extensive experimental investigation is conducted using the publicly available Bosphorus and BU-3...
متن کاملUSRCAT: real-time ultrafast shape recognition with pharmacophoric constraints
UNLABELLED BACKGROUND Ligand-based virtual screening using molecular shape is an important tool for researchers who wish to find novel chemical scaffolds in compound libraries. The Ultrafast Shape Recognition (USR) algorithm is capable of screening millions of compounds and is therefore suitable for usage in a web service. The algorithm however is agnostic of atom types and cannot discrimina...
متن کاملA Partial Shape Matching Method for 3d Model Databases
The use of 3D models is gaining popularity since they are important for computer graphics applications. Recently, similarity retrieval techniques for 3D models have been investigated intensively for handling databases of 3D models systematically. The techniques extract shape descriptors from 3D models and use these descriptors for indices for comparing shape similarities. Various shape descript...
متن کاملUSR-VS: a web server for large-scale prospective virtual screening using ultrafast shape recognition techniques
Ligand-based Virtual Screening (VS) methods aim at identifying molecules with a similar activity profile across phenotypic and macromolecular targets to that of a query molecule used as search template. VS using 3D similarity methods have the advantage of biasing this search toward active molecules with innovative chemical scaffolds, which are highly sought after in drug design to provide novel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007